NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Unichain and Aperiodicity Are Sufficient for Asymptotic Optimality of Average-Reward Restless Bandits

https://doi.org/10.1287/moor.2024.0678

Hong, Yige; Xie, Qiaomin; Chen, Yudong; Wang, Weina (December 2025, Mathematics of Operations Research)

We consider the infinite-horizon, average-reward restless bandit problem in discrete time. We propose a new class of policies that are designed to drive a progressively larger subset of arms toward the optimal distribution. We show that our policies are asymptotically optimal with an [Formula: see text] optimality gap for an N-armed problem, assuming only a unichain and aperiodicity assumption. Our approach departs from most existing work that focuses on index or priority policies, which rely on the Global Attractor Property to guarantee convergence to the optimum, or a recently developed simulation-based policy, which requires a Synchronization Assumption.
more » « less
Free, publicly-accessible full text available December 11, 2026
Multi-task Representation Learning for Fixed Budget Pure-Exploration in Linear and Bilinear Bandits

Mukherjee, Subhojyoti; Xie, Qiaomin; Nowak, Robert D (August 2025, Reinforcement Learning Journal)

Free, publicly-accessible full text available August 8, 2026
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning

Mukherjee, Subhojyoti; Hanna, Josiah P; Xie, Qiaomin; Nowak, Robert D (August 2025, Reinforcement Learning Journal)

Free, publicly-accessible full text available August 8, 2026
Stable Offline Value Function Learning with Bisimulation-based Representations

Pavse, Brahma S; Chen, Yudong; Xie, Qiaomin; Hanna, Josiah P (July 2025, Proceedings of Machine Learning Research)

Free, publicly-accessible full text available July 17, 2026
A Piecewise Lyapunov Analysis of Sub-quadratic SGD: Applications to Robust and Quantile Regression

https://doi.org/10.1145/3744970.3727269

Zhang, Yixuan; Huo, Dongyan Lucy; Chen, Yudong; Xie, Qiaomin (June 2025, ACM SIGMETRICS Performance Evaluation Review)

Motivated by robust and quantile regression problems, we investigate the stochastic gradient descent (SGD) algorithm for minimizing an objective functionfthat is locally strongly convex with a sub--quadratic tail. This setting covers many widely used online statistical methods. We introduce a novel piecewise Lyapunov function that enables us to handle functionsfwith only first-order differentiability, which includes a wide range of popular loss functions such as Huber loss. Leveraging our proposed Lyapunov function, we derive finite-time moment bounds under general diminishing stepsizes, as well as constant stepsizes. We further establish the weak convergence, central limit theorem and bias characterization under constant stepsize, providing the first geometrical convergence result for sub--quadratic SGD. Our results have wide applications, especially in online statistical methods. In particular, we discuss two applications of our results. 1) Online robust regression: We consider a corrupted linear model with sub--exponential covariates and heavy--tailed noise. Our analysis provides convergence rates comparable to those for corrupted models with Gaussian covariates and noise. 2) Online quantile regression: Importantly, our results relax the common assumption in prior work that the conditional density is continuous and provide a more fine-grained analysis for the moment bounds.
more » « less
Free, publicly-accessible full text available June 16, 2026
A Piecewise Lyapunov Analysis of Sub-quadratic SGD: Applications to Robust and Quantile Regression

https://doi.org/10.1145/3726854.3727269

Zhang, Yixuan; Huo, Dongyan Lucy; Chen, Yudong; Xie, Qiaomin (June 2025, ACM)

Free, publicly-accessible full text available June 9, 2026
Stable Offline Value Function Learning with Bisimulation-based Representations

Pavse, Brahma S; Chen, Yudong; Xie, Qiaomin; Hanna, Josiah P (June 2025, International Conference on Machine Learning (ICML),)

Free, publicly-accessible full text available June 13, 2026
Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent

https://doi.org/10.1609/aaai.v39i17.34035

Li, Xiang; Xie, Qiaomin (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

The convergence behavior of Stochastic Gradient Descent (SGD) crucially depends on the stepsize configuration. When using a constant stepsize, the SGD iterates form a Markov chain, enjoying fast convergence during the initial transient phase. However, when reaching stationarity, the iterates oscillate around the optimum without making further progress. In this paper, we study the convergence diagnostics for SGD with constant stepsize, aiming to develop an effective dynamic stepsize scheme. We propose a novel coupling-based convergence diagnostic procedure, which monitors the distance of two coupled SGD iterates for stationarity detection. Our diagnostic statistic is simple and is shown to track the transition from transience stationarity theoretically. We conduct extensive numerical experiments and compare our method against various existing approaches. Our proposed coupling-based stepsize scheme is observed to achieve superior performance across a diverse set of convex and non-convex problems. Moreover, our results demonstrate the robustness of our approach to a wide range of hyperparameters.
more » « less
Free, publicly-accessible full text available April 11, 2026
Two-Timescale Linear Stochastic Approximation: Constant Stepsizes Go a Long Way

Kwon, Jeongyeol; Dotson, Luke; Chen, Yudong; Xie, Qiaomin (May 2025, Proceedings of Machine Learning Research)

Free, publicly-accessible full text available May 3, 2026
Two-Timescale Linear Stochastic Approximation: Constant Stepsizes Go a Long Way

Kwon, Jeongyeol; Dotson, Luke; Chen, Yudong; Xie, Qiaomin (January 2025, International Conference on Artificial Intelligence and Statistics (AISTATS), 2025)

Free, publicly-accessible full text available January 31, 2026

« Prev Next »

Search for: All records